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ABSTRACT 


ON  BERRY-ESSEEN  RATES  FOR  STATISTICAL 
FUNCTIONS,  WITH  APPLICATION  TO  L-ESTIMATES 

A parameter  expressed  as  a functional  T(F)  of  a distribution 

function  (d.f.)  F may  be  estimated  by  the  "statistical  function" 

T(F  ) based  on  the  sample  d.f.  F . Typically,  T(F  ) is  asymptoti- 
n n n 

cally  normal.  We  investigate  the  rate  of  this  convergence  by  utilizing 
the  von  Mises  (1947)  representation  to  express  T(Fn)  “ T(F)  as  an 
.pproxlm.ce  o-.tmtl.tlc  plus  R,,  s»d  spplylog  Che  Berry-Esseeu  rate 
0(n  ) established  for  U-statistics  by  Callaert  and  Janssen  (1978). 

This  essentially  reduces  the  problem  to  a handling  of  Rn-  We  carry  out 
this  method  for  linear  functions  of  order  statistics  ("L-estimates") 
and  obtain  results  competitive  with  Bjerve  (1977)  and  Helmers  (1977). 
Also,  we  briefly  indicate  the  application  of  the  method  to  M-estimates. 
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0.  Surmary.  Let  T(*)  be  a real-valued  functional  defined  on  dis- 
tribution functions  (d.f.'s).  For  a sample  X^ Xr  from  a d.f.  F, 

consider  estimation  of  the  "parameter"  T(F)  by  the  sample  analogue 

estimator  T(Fq)  based  on  the  usual  sample  d.f.  F^(x)  " n I(X^<x), 

2 

-“  < x < • . In  many  cases,  for  a suitable  constant  o (T,F),  the  d.f. 
u 

of  n [T(Fn>  - T(F)]/o(T,F)  converges  weakly  to  the  standard  normal 
d.f.  #.  In  Section  1 we  formulate  a general  approach  toward  investiga- 
tion of  the  rate  of  this  convergence.  Taking  two  terms  of  the 
von  Mises  (1947)  Taylor  expansion  for  statistical  functions,  we  represent 
T(Fr)  - T(F)  as  an  approximate  U-statistic  plus  a remainder  term  Rn> 
Under  appropriate  conditions,  we  may  dispense  with  R^,  switch  to  an 
exact  U-statistic,  and  exploit  the  Berry-Essden  rate  0(n  ) established 

for  U-statistics  by  Callaert  and  Janssen  (1978).  (On  the  basis  of  only 
one  term  from  von  Mises’  expansion,  the  method  typically  yields  a some- 
what weaker  rate.)  In  Section  2 the  method  is  applied  to  linear  functions 
of  order  statistics  ("L-estimates") , as  represented  by  functionals  of  the 
form  T(F)  ■ j^F  ^(u)J(u)du  . The  rate  0(n  **)  is  obtained  under  condi- 
tions on  J and  F competitive  with  Bjerve  (1977)  and  Helmers  (1977).  In 
Section  3 the  application  of  the  approach  to  M-estimates  is  sketched. 

1.  A general  approach.  Utilizing  the  notion  of  Taylor  expansion 
for  statistical  functions  introduced  by  von  Mises  (1947),  and  taking  two 
terms,  we  represent  T(?n)  - T(F)  as  V*  + remainder,  where 

K • 31  + ^ i(U-x)r«rn)|A^  . 
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Typically,  the  terms  in  may  be  represented  in  the  forms  /a(x)dFn(x) 

and  >j//B(x,y)dF  (x)dF  (y) , respectively.  In  this  case  V*  has  the 
n n n 

form  (1.1)  below,  with  h(x,y)  * *s[a(x)  + a(y)  + 8(x,y)]  . The  following 

theorem,  stated  for  this  setting  of  a statistical  function  T(F  ) 

n 

estimating  T(F),  is  in  fact  valid  in  connection  with  any  statistic 

T - T (X,,  ...,  X ) estimating  a parameter  T.,  but  depends  upon 
n n i n u 

finding  a suitable  function  b(x,y). 

THEOREM  1.1.  Suppose  that  T(Fn>  - T(F)  may  he  represented  as 
Vr  + R^,  where  is  given  by 

- n n 

(1.1)  V - n~*  [ l h(X.,X.)  , 

n i-1  j-1  1 J 

with  h(x,y)  symmetric  in  its  arguments  and  satisfying 
(1.2a)  EfI{h(X1,X2)}  - 0 , 

(1.2b)  EF|h(X1,X2)|3  < » , 

(1.2c)  EF|h(X1,X1)|3/2  < » , 

and  where,  for  some  A > 0 , 

(1.3)  P(|rJ  > An-1)  - 0(n_l5),  . 

Put  o2(T,F)  - 4VarF(g(X1)} , where  g(x)  ■ EF(h(x,X1)}  . Then 

(1.4)  supt|P(ni*[T(Fn)-T(F)]/o(T,F)  It)  - *(t)|  - 0(n-Js) , . 

He  obtain  Theorem  1.1  by  successive  application  of  the  following 
well-known  and  easily  proved  device. 
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LEMMA  1.1  Let  the  sequence  of  rv' a {CR  } satisfy 

(*)  supt|P(Cnit  - $(t)|  - 0(n  '**)  . 

Then,  for  any  sequence  of  rv's  (A^ } and  constant  B , 

(**)  suPj.lPC^+a^t)  - ♦(t)|  - 0(n_li)  + P(|an|>Bn_ls)  . 

For  the  first  application,  put  £ • nS^/ad.F)  and  A^  ■ nSwad.F)  . 

Then,  by  (1.3),  it  suffices  for  (1.4)  to  establish  (*) . Now,  for  the 

U-statiatic  of  Hoeffding  (1948)  associated  with  the  kernel  h,  namely 

Uq  - Jl<i<j<n  h^Xi,Xj)»  Callaert  and  Janssen  (1978)  establish 

(*)  for  C ■ nSj  / o(T,F) , under  the  conditions  (1.2a,b).  Therefore, 
n n 

to  complete  the  proof  of  Theorem  1.1,  it  suffices  to  show  that 
L -L  -L 

P(n^|Un  - V | > Bn  ^)  • 0(n  v for  some  constant  B.  This  is  established 
in  the  following  result. 

LEMMA  1.2.  Suppose  that  h(x,y)  is  symmetric  in  its  arguments 
and  satisfies  Eh2(X.,X2)  < - and  E|h(X1,X1)  | 3/2  < - . Then,  for 
B > 2|E{h(XrX2)  - h(X1,X1)  }|  , 

P(|Un  - Vn|  > Bn-1)  - o(n_Js)  , tt—  . 

PROOF.  Check  that  U - V - n-1(U  - W ) , where  W - n_1^“  ,h(X. ,X. ) 

n n n n n i“i  i 1 

Then,  for  B > 2|E(h(X1,X2)  - h(X1,X1)}|  , we  have 

P(|Un  - Vj  > Bn'1)  - P(|Un  - Wq|  > B) 

< P(|u  -W  - E{U  ) + E{W„  )|  > >jB) 

“ n n n n 

< P(|u  - E{U  )|  > B/4)  ♦ P(|W  - E{W  }|  > B/4)  . 

“ ■ n n n n 
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The  first  term  on  the  right  is  0(n  by  Chebyshev's  inequality  and  the 
well-known  rate  Var{Un)  ■ 0(n  ^) . For  the  second  term,  we  use  Theorem  4 
of  Bauai  and  Katz  (1965),  which  implies:  for  {Y^}  i.i.d.  with  EtY^}  - 0 
and  E | Yj  | r < • , where  r >_  1,  P ( | Y | > e)  ■ o(n1-r) , for  all  e > 0 . 

Ue  thus  apply  this  result  with  r - 3/2  . □ 

REMARK.  Mote  (using  (**))  that  even  if  the  optimum  rate  0(n  ) is 

not  known  for  R in  (1.3),  we  still  may  obtain  a (weaker)  Berry-Essden 
n 

rate  by  replacing  0(n  **)  in  (1.4)  by  0(P(  | R^  | > An-1)).  □ 

2.  l-eati/nates.  Consider  the  functional  T(F)  • J1 F 1(u)J(u)du 

and  the  corresponding  L-estimate  T(F  ) . Implementing  the  approach  of 

n 

Section  1,  we  find  (under  regularity  conditions  on  J) 

V*  - -f[F  (x)  - F(x)  )J°F(x)dx  - *jf(F  (x)  - F(x)  ]2J '<>F(x)dx 
n J n J n 

—00  —00 

and  thus  V*  ■ of  form  (1.1)  with  h(x,y)  ■ 1s[u(x)  + a(y)  + 6(x,y)], 
where 

ao 

(2.1a)  a(x)  - - | [I(x  < t)  - F(t) ]J°F(t)dt 

and 

OD 

(2.1b)  6(x,y)  - - J Cl(x  < t)  - F(t)  ][I(y  it)  - F(t)  ]J'<>F(t)dt  . 

Writing  K(u)  - j^J (u)du , we  obtain  by  integration  by  parts  that 

T(G)  - T(F)  - [K»G(x)  - K«F(x) ]dx  . Therefore,  for  R * T(F  ) - 

_«  n n 

T(F)  - V we  obtain 
n 


where 


W„  _ “ KoG  - K«F  - (J°F)  (G-F)  - ii(J'®F)  (G-F)2  . 

G,F 

Having  found  h and  heurlstlcally,  we  now  rigorously  establish  two 

Berry-Essden  theorems  for  L-estimates  by  introducing  sets  of  conditions  on 

J and  F under  which  h and  R are  well-defined  and  satisfy  the  re- 

n 

quirements  of  Theorem  1.1.  The  relevant  asymptotic  variance  parameter  is 


a2(J,F) 


J°F(x)J°F(y)[F(min(x,y))  - F(x)F(y) ]dxdy  . 


THEOREM  2.1.  Suppose  that  J vanishes  outside  an  interval  Ca,b]  , 

0 < a < b < 1,  and  that  J'  exists  and  satisfies  a Lipschitz  condition 
of  order  6 > 0 on  an  open  interval  containing  [a,b].  Assume 
o2 (J,F)  > 0 . Then 

(2.3)  supt|P(nJs[T(Fn)  - T(F)]/o(J,F)  It)  - 4>(t)  | - 0(n->s) , n~>  . 

THEOREM  2.2.  Suppose  that  J'  exists  and  satisfies  a Lipschitz  con- 
dition  of  order  6 > 1/3  on  (0,1).  Assume  that  EpI^jJ  < " and  that 
0 < o2(J,F)  < ».  Then  (2.3)  holds. 

REMARKS.  (1)  Theorem  2.1  requires  that  J be  trimmed  away  from 

2 

0 and  1,  but  Imposes  no  conditions  on  F other  than  o (J,F)  > 0. 

Bjerve  (1977),  also  for  the  case  of  J trimmed,  obtains  (2,3)  requiring 
only  that  J be  Lipschitz  of  order  1 on  an  open  interval  containing 
[a,b],  but  furthermore  requiring  that  F-*  possess  a second  derivative 
also  satisfying  such  a Lipschitz  condition. 
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(ii)  Theorem  2.2  allows  J to  have  support  (0,1)  but  Imposes  a 
standard  moment  condition  on  F.  Helmers  (1977)  proves  a similar  theorem. 
He  allows  J'  not  to  exist  at  finitely  many  points  but  requires  J'  to 
be  Llpschltz  of  order  > h on  the  open  intervals  where  it  exists  and 

F ^ to  be  Lipschitz  of  order  > on  neighborhoods  of  the  points  where 
J'  does  not  exist.  (The  additional  restriction  J|j'|dF  ^ < « required 
in  Helmers  (1977a)  is  eliminated  in  Helmers  (1977b).) 

(iii)  Theorems  2.1  and  2.2  remain  valid  with  T(F  ) replaced  in 

n 

(2.3)  by  the  closely  related  statistic  T - n ^7”  , J(i/(n+l))X  . , 

n l“i  m 

where  X . < ...  < X denote  the  ordered  sample  values.  To  see  this, 
nl  — — nn 

write  T(F  ) •»  7?  ,[K(i/n)  - K( (i-1) /n) JX  , and  apply  standard  arguments, 
n i“i  nl 

For  example,  in  the  case  of  Theorem  2.2,  we  use  boundedness  of  J'  to 

write  |T(F  ) - T I < Mn  ^7”  |X. I for  a constant  M.  Put 
1 n n1  — ‘*i«ll  i1 

A - n**[T(F  ) - T ] and  obtain  P( f A ( > 2MElx.  ( n_>5)  < P(n_17"  Jx.l  > 
n n n 1 n 1 ' 1'  — Li«l'  i1 

2E|Xi|)  < P(|n“1Xj-1|xi|  - E|X1||  > E|X1|)  - O^"1)  by  Chebyshev's 

2 

inequality,  since  E X^  < ® , Then  apply  Lemma  1.1.  0 

PROOF  OF  THEOREM  2.1.  First  we  utilize  the  assumption  that  J is 
trimmed.  Let  0 < e < min{a,l-b}  . Then  there  exist  A,B  such  that 
- • < A < F ^(a-c)  < F ^(b+e)  < B < « . Hence,  defining 
I I q I I „ ■ supx|q(x)|,  we  have  that  | |G-F| | ^ < e implies  ^G  F(x)  - 0 
for  x 4 CA.B]  . Therefore,  defining 

B 

Rn(A»B)  “ ~ | wp  F(*>d*  » 

A n’ 

we  have 

P(|Rn|  > Cn-1)  £ P(|Rn(A,B)|  > Cn"1)  + P(| |Fn  - F| 1^  > e)  . 
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Now  apply  the  Llpschltz  condition  on  J'  , whereby  |j'(u^)  - J'(u2)|  ± 
D|u^  - u^l  , to  obtain 

lRn(A*B)  I ^ 4(B-A)D  IIf^-fI  |f+<s. 

where  6 > 0 . Thus,  with  - 2C/(B-A)D  , 

(2.4)  P(|Rn|  > Cn'1)  < P(n||Fn-F||*+6>C1)  + PCllF^Fll,  > e)  . 

I A 

We  now  apply  the  inequality  P(n*|  | F^— F | 1^  _>  d)  <_  DQexp(-2d  ;,  d > 0, 

where  Dq  is  a constant,  due  to  Dvoretzky,  Kiefer  and  Wolfowitz  (1956). 

-k 

It  is  readily  seen  that  the  terms  on  the  right  in  (2.4)  are  each  0(n  ), 

so  that  R satisfies  (1.3). 
n 

The  required  properties  of  h are  obtained  easily.  Restricting 

the  range  of  integration  in  (2.1)  to  the  interval  [A,fJ,  finiteness  of 

moments  follows.  Further,  interchanging  expectation  and  integration  by 

Fubini's  theorem,  we  find  E{a(X^) ) ■ 0,  E{f5(x,X^)  } ■ 0 and  thus  also 

E(8(X1,X2) } - 0 and  g(x)  - E{h(x,X1)  } - ija(x) . Thus  a2(T,F)  of 

Theorem  1.1  is  given  by  Ea2(X^)  ■ o2(J,F)  . □ 

In  proving  Theorem  2.2,  we  will  need  a property  of  | | F — F J | , 

n 1^2 

where  ||*||  denotes  the  L_-norm,  ||q||T  ■ C/q2(x)dx]ls  . 

L2  z L2 

, lk 

LEMMA  2.1.  Let  k be  a positive  integer  and  assume  E_ | X-  ] < « . 

r i 

Then 


<2*5>  E{  | |F  -F|  |.2k>  - 0(n“k)  , ir«  . 

n l 2 
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PROOF.  Put  Y1(t)  - I(Xt  £ t)  - F(t) , 1 < i<  n . Then 


(2.6)  E{||Fn-F||2k}  . n^K  l l E{  n 


-2k 


n n n n 


V1  V1  V1  V1  ‘-1 


Y (t)Y  (t)dtl 
1H 


Since  E|X^|  < «,  integration  by  parts  yields  the  inequality 

(2,7)  JlKX^t)  - F(t) | d t £ |xx|  + e|x1|  . 


Consequently,  the  expectation  term  on  the  right  in  (2.6)  is  finite  since 


E | Xx | < » and,  by  Fubini's  theorem. 


(2.8)  e{  n [y  (t)Y  (t)dt}  - f . - . [e{  n Y (t. )Y.  (t,)}dt.  ...  dt.  . 
l«lJ  ] J *■  *■  1 *■ 


By  independence  of  Y (s.),  ....  Y (s  ) for  any  s. s , we  have 

■Li.  n n i n 


- Vtk)V'kH  ” 0 


except  possibly  in  the  case  that  each  index  in  the  list  i^ , , . . . ,i^, 

appears  at  least  twice.  In  this  case  the  number  of  distinct  elements 


in  the  set  (i^»j^,  • . . *ijt»Jjt  J is  k . It  follows  that  the  number  of 


ways  to  choose  i^tJ^» • • • such  that  the  expectation  in  (2.8)  is 


nonzero  is  0(n  ) . Thus  the  number  of  nonzero  terms  in  the  summation 


in  (2.6)  is  0(n  ) . Q 


PROOF  OF  THEOREM  2.2.  Applying  the  Lip  condition  on  J",  we  obtain 


lRnl  ±*>jlFnOO  - F(x)|2+6dx  < %D||Fn-F||i  .||Fn-F| 


L2  * 
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Thus,  for  any  A > 0, 


P(|Rj  > An’1)  < P(B||Fn-F||*||Fn-F||£  > 2A/D) 

< P(n1/6| |Fn-F| |*  > 2A/D)  + P(n5/6| | Fn~F | > 1) 
For  6 > 1/3,  the  first  term  above  is  0(n  ) by  the  DKW  inequality 

_Sj 

used  in  the  proof  of  Theorem  2.1.  The  second  term  above  is  0(n  ) 

by  Lemma  2.1,  applied  for  k ■ 3.  Therefore,  R^  satisfies  (1.3). 

The  required  properties  of  h are  established  in  similar  fashion 
as  in  the  proof  of  Theorem  2.1.  In  this  connection,  the  inequality  (2.7) 
is  useful  in  proving  finiteness  of  moments.  0 

REMARK.  It  is  evident  from  the  preceding  proof  that  under  a higher 
moment  assumption  E|x. |V  < * , where  v is  an  integer  > 3,  the  Lip 
condition  on  J'  may  be  relaxed  to  order  6 > 1/v  . □ 


3.  M-e8timatea.  Let  ty  be  a function  such  that  the  parameter  of 

interest  T(F)  may  be  defined  as  a solution  of  the  equation  *p(t)  ■ 0, 

where  Xp(t)  ” ,A|i(x-t)dF(x)  . Thus  T(*)  represents  an  "M-functional 

with  respect  to  if  X^(T(F))  ■ 0,  all  F . In  the  following  let  us 

put  T for  T(F  ) and  t for  T(F)  . 
r n n o 

The  approach  of  Section  1 leads  to 


and 


ot(x) 


^(x-to) 

V(t  ) 

F O 
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_2a£x)  r^/  t \ 

X '(t  ) LV  V 

r o 


+ Xp'(to)  - >*XF"(t0)a(y)]  . 


6(x,y) 


according  as 


Defining  h(t)  - CA^t)  - Ap( tQ)  3/ (t-tQ)  or  AF'(tQ)  . 

t 4 t or  t ■ t , we  obtain  for  R ■ T - t - V 
o o n n o n 


/C«J)(x-T  )-«x-t  ) ]d[F  (x)-F(x)  ] 

R S 2 2 + 

11  h<V 


A '(t  ) ~ h(T  ) ’Kx-t0)dCFn(x)-F(x)] 
r O n ' 


/*(x-t  )d[F  (x)-F(x)]  "(x-t  )d[F  (x)-F(x)] 

o n o n 

CA_'(t  )]2 

r o 


V-(to){/*(x"to)dCFn<x)“F(x)]  f 


2[AF'(to)r 


A brute  force  treatment  of  R leads  to 

n 


iRni  ±s.  iiFn-Fn: 


under  moderate  restrictions  on  y , i|>',  {p",  and  on  X '(t),  A "(t) 

F F 

a&d  Ap"'(t)  for  t in  a neighborhood  of  tQ.  An  application  of  the 

DKW  inequality  cited  in  Section  2 then  yields  (1.3)  for  R . Condition 

n 

(1.2)  is  easily  verified.  For  other  discussion  of  the  Berry-Ess£en  rate 
for  M-estimates,  see  Bickel  (1974). 
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A parameter  expressed  as  a functional  T(F)  of  a distribution  function  (d.f.) 

F may  be  estimated  by  the  •’statistical  function"r(Fn)  based  on  the  sanple  d.f. 
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Fn^  Typically,  T(Fn)  is  asymptotically  normal.  We  investigate  the  rate  of  this 
convergence  by  utilizing  the  von  Mises  (1047)* representation  to  express  T(Fn)  - T(F) 
as  an  approximate  U-statistic  plus  R^,  and  applying  the  Berry-Essden  rate  0(rf*) 
established  for  U-statistics  by  Callaert  and  Janssen  (W$^  This  essentially  reduces 
the  problem  to  a handling  of  R^.  We  carry  out  this  method  for  linear  functions  of 
order  statistics  (’’L-estimates"}  and  obtain  results  competitive  with  Bjerve  (±977} 
and  Helmers  (±$77) . Also,  we  briefly  indicate  the  application  of  the  method  to 
M-estimates.  ^ 


